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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Robert G. Ulrich 



(ii) TITLE OF INVENTION: Bacterial Superantigen 
Vaccines 

(iii) NUMBER OF SEQUENCES: 31 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Elizabeth Arwine 

(B) STREET: US Army MRMC -504 Scott Street 

MCMR-JA (Elizabeth Arwine -Patent 

Atty) 

(C) CITY: FORT DETRICK 

(D) STATE: MARYLAND 

(E) COUNTRY: USA 

(F) ZIP: 21702-5012 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Apple Macintosh 

(C) OPERATING SYSTEM: Macintosh 7.5 

(D) SOFTWARE: Microsoft Word 6.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/882,431 

(B) FILING DATE: June 25, 1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Sana A. Pratt 

(B) REGISTRATION NUMBER: 39,441 

(C) REFERENCE /DOCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: (301) 619-2065 

(B) TELEFAX: (301) 619-7714 



(2) INFORMATION FOR SEQUENCE ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 
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(D) TOPOLOGY: Unknown 
(ii) Molecule type: DNA 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 






ATGAAAAAAA 


CAGCATTTAC 


ATTACTTTTA 


TTCATTGCCC 


40 




TAACGTTGAC 


AACAAGTCCA 


CTTGTAAATG 


GTAGCGAGAA 


80 




AAGCGAAGAA 


ATAAATGAAA 


AAGATTTGCG 


AAAAAAGTCT 


120 




GAATTGCAGG 


GAACAGCTTT 


AGGCAATCTT 


AAACAAATCT 


160 




ATTATTACAA 


TGAAAAAGCT 


AAAACTGAAA 


ATAAAGAGAG 


200 




TCACGATCAA 


TTTCGACAGC 


ATACTATATT 


GTTTAAAGGC 


240 




TTTTTTACAG 


ATCATTCGTG 


GTATAACGAT 


TTATTAGTAC 


280 




GTTTTGATTC 


AAAGGATATT 


GTTGATAAAT 


ATAAAGGGAA 


320 




AAAAGTAGAC 


TTGTATGGTG 


CTTATGCTGG 


TTATCAATGT 


360 




GCGGGTGGTA 


CACCAAACAA 


AACAGCTTGT 


ATGTATGGTG 


400 


|1J 


GTGTAACGTT 


ACATGATAAT 


AATCGATTGA 


CCGAAGAGAA 


440 




AAAAGTGCCG 


ATCAATTTAT 


GGCTAGACGG 


TAAACAAAAT 


480 




ACAGTACCTT 


TGGAAACGGT 


TAAAACGAAT 


AAGAAAAATG 


520 


m 


TAACTGTTCA 


GGAGTTGGAT 


CTTCAAGCAA 


GACGTTATTT 


560 




ACAGGAAAAA 


TATAATTTAT 


ATAACTCTGA 


TGTTTTTGAT 


600 




GGGAAGGTTC 


AGAGGGGATT 


AATCGTGTTT 


CATACTTCTA 


640 




CAGAACCTTC 


GGTTAATTAC 


GATTTATTTG 


GTGCTCAAGG 


680 




ACAGTATTCA 


AATAC AC TAT 


TAAGAATATA 


TAGAGATAAT 


720 




AAAACGATTA 


ACTCTGAAAA 


CATGCATATT 


GATATATATT 


760 




TATATACAAG 


TTAAACATGG 


TAGTTTTGAC 


CAACGTAATG 


800 




TTCAGATTAT 


TATGAACCGA 


GAATAATCTA 


830 



( 3 ) INFORMATION FOR SEQUENCE ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 257 
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(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 
(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Lys Thr Ala Phe Thr Leu Leu Leu 
15 10 

Phe lie Ala Leu Thr Leu Thr Thr Ser Pro 

15 20 

Leu Val Asn Gly Ser Glu Lys Ser Glu Glu 

25 30 

lie Asn Glu Lys Asp Leu Arg Lys Lys Ser 
35 40 

Glu Leu Gin Gly Thr Ala Leu Gly Asn Leu 

45 50 

Lys Gin lie Tyr Tyr Tyr Asn Glu Lys Ala 

55 60 

Lys Thr Glu Asn Lys Glu Ser His Asp Gin 

65 70 

Phe Arg Gin His Thr lie Leu Phe Lys Gly 

75 80 

Phe Phe Thr Asp His Ser Trp Tyr Asn Asp 

85 90 

Leu Leu Val Arg Phe Asp Ser Lys Asp lie 

95 100 

Val Asp Lys Tyr Lys Gly Lys Lys Val Asp 

105 110 

Leu Tyr Gly Ala Tyr Ala Gly Tyr Gin Cys 

115 120 

Ala Gly Gly Thr Pro Asn Lys Thr Ala Cys 

125 130 

Met Tyr Gly Gly Val Thr Leu His Asp Asn 

135 140 

Asn Arg Leu Thr Glu Glu Lys Lys Val Pro 

145 150 

lie Asn Leu Trp Leu Asp Gly Lys Gin Asn 

155 160 
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Thr Val Lys Thr Asn 

170 



Val Gin Glu Leu Asp 

180 

Tyr Leu Gin Glu Lys 

190 

Ser Asp Val Phe Asp 

200 

Gly Leu He Val Phe 

210 

Pro Ser Val Asn Tyr 
220 

Gin Gly Gin Tyr Ser 

230 

He Tyr Arg Asp Asn 
240 

Glu Asn Met His He 

250 

Thr Ser 



5S 3S: 
til 

^ ( 4 ) INFORMATION FOR SEQUENCE ID NO : 3 : 

O (i) SEQUENCE CHARACTERISTICS: 

M- (A) LENGTH: 757 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 
(ii) Molecule type: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATGAGAAAAG C GAAGAAAT A AATGAAAAAG ATTTGCGAAA 4 0 



AAAGTCTGAA TTGCAGGGAA CAGCTTTAGG CAATCTTAAA 80 



CAAATCTATT ATTACAATGA AAAAGCTAAA AC TGAAAAT A 12 0 



AAGAGAGTCA CGATCAATTT CGACAGCATA CTATATTGTT 160 



TAAAGGCTTT TTTACAGATC ATTCGTGGTA TAACGATTTA 2 00 



Thr Val Pro Leu Glu 

165 

Lys Lys Asn Val Thr 

175 

Leu Gin Ala Arg Arg 

185 

Tyr Asn Leu Tyr Asn 

195 

Gly Lys Val Gin Arg 

205 

His Thr Ser Thr Glu 

215 

Asp Leu Phe Gly Ala 

225 

Asn Thr Leu leu Arg 

235 

Lys Thr lie Asn Ser 

245 

Asp lie Tyr Leu Tyr 

255 
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TTAGTACGTT TTGATTCAAA GGATATTGTT GATAAATATA 240 

AAGGGAAAAA AGTAGACTTG TATGGTGCTT ATGCTGGTTA 280 

TCAATGTGCG GGTGGTACAC CAAACAAAAC AGCTTGTATG 32 0 

TATGGTGGTG TAACGTTACA TGATAATAAT CGATTGACCG 3 60 

AAGAGAAAAA AGTGCCGATC AATTTATGGC TAGACGGTAA 400 

ACAAAATACA GTACCTTTGG AAACGGTTAA AACGAATAAG 440 

AAAAATGTAA CTGTTCAGGA GTTGGATCTT CAAGCAAGAC 480 

GTTATTTACA GGAAAAATAT AATTTATATA ACTCTGATGT 520 

TTTTGATGGG AAGGTTCAGA GGGGATTAAT CGTGTTTCAT 560 

ACTTCTACAG AACCTTCGGT TAATTACGAT TTATTTGGTG 600 

CTCAAGGACA GTATTCAAAT ACACTATTAA GAATATATAG 640 

AGATAATAAA ACGATTAACT CTGAAAACAT GCATATTGAT 680 

ATATATTTAT ATACAAGTTA AACATGGTAG TTTTGACCAA 720 

CGTAATGTTC AGATTATTAT GAACCGAGAA TAATCTA 757 

(5) INFORMATION FOR SEQUENCE ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Lys Ser Glu Glu lie Asn Glu Lys 

5 10 

Asp Leu Arg Lys Lys Ser Glu Leu Gin Gly 

15 20 

Thr Ala Leu Gly Asn Leu Lys Gin lie Tyr 

25 30 

Tyr Tyr Asn Glu Lys Ala Lys Thr Glu Asn 

35 40 

Lys Glu Ser His Asp Gin Phe Arg Gin His 

45 50 
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Thr He Leu Phe Lys Gly Phe Phe Thr Asp 

55 60 

His Ser Trp Tyr Asn Asp Leu Leu Val Arg 

65 70 

Phe Asp Ser Lys Asp He Val Asp Lys Tyr 

75 80 

Lys Gly Lys Lys Val Asp Leu Tyr Gly Ala 

85 90 

Tyr Ala Gly Tyr Gin Cys Ala Gly Gly Thr 

95 100 

Pro Asn Lys Thr Ala Cys Met Tyr Gly Gly 

105 HO 

Val Thr Leu His Asp Asn Asn Arg Leu Thr 

115 120 

Glu Glu Lys Lys Val Pro He Asn Leu Trp 

125 130 

Leu Asp Gly Lys Gin Asn Thr Val Pro Leu 

135 140 

Glu Thr Val Lys Thr Asn Lys Lys Asn Val 

145 150 

Thr Val Gin Glu Leu Asp Leu Gin Ala Arg 

155 160 

Arg Tyr Leu Gin Glu Lys Tyr Asn Leu Tyr 

165 170 

Asn Ser Asp Val Phe Asp Gly Lys Val Gin 

175 180 

Arg Gly Leu He Val Phe His Thr Ser Thr 

185 190 

Glu Pro Ser Val Asn Tyr Asp Leu Phe Gly 

195 200 

Ala Gin Gly Gin Tyr Ser Asn Thr Leu Leu 

205 210 

Arg He Tyr Arg Asp Asn Lys Thr He Asn 

215 22 0 
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Ser Glu Asn Met His He Asp He Tyr Leu 

225 230 

Tyr Thr Ser 



(6) INFORMATION FOR SEQUENCE ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1712 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAACTAGGTA GAAAAATAAT TATGAGAAAA CACTATGTTG 40 

TTAAAGATGT TTTCGTATAT AAGTTTAGGT GATGTATAGT 80 

TACTTAATTT TAAAAGCATA ACTTAATTAA TATAAATAAC 12 0 

ATGAGATTAT TAAATATAAT TAAGTTTCTT TTAATGTTTT 160 

TTTAATTGAA TATTTAAGAT TATAACATAT ATTTAAAGTG 2 00 

TATCTAGATA CTTTTTGGGA ATGTTGGATA AAGGAGATAA 2 40 

AAAATGTATA AGAGATTATT TATTTCACAT GTAATTTTGA 2 80 

TATTCGCACT GATATTAGTT ATTTCTACAC CCAACGTTTT 320 

AGCAGAGAGT CAACCAGATC CTAAACCAGA TGAGTTGCAC 3 60 

AAATCGAGTA AATTCACTGG TTTGATGGAA GATATGAAAG 400 

TTTTGTATGA TGATAATCAT GTATCAGCAA TAAACGTTAA 440 

ATCTATAGAT CAATTTCTAT ACTTTGACTT AATATATTCT 48 0 

ATTAAGGACA CTAAGTTAGG GGATTATGAT AATGTTCGAG 520 

TCGAATTTAA AAACAAAGAT TTAGCTGATA AATACAAAGA 5 60 

TAAATACGTA GATGTGTTTG GAGCTAATTA TTATTATCAA 600 

TGTTATTTTT CTAAAAAAAC GAATGATATT AATTCGCATC 640 

AAACTGACAA AC GAAAAACT TGTATGTATG GTGGTGTAAC 680 
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TGAGCATAAT GGAAACCAAT TAGATAAATA TAGAAGTATT 72 0 

ACTGTTCGGG TATTTGAAGA TGGTAAAAAT TTATTATCTT 7 60 

TTGACGTACA AACTAATAAG AAAAAGGTGA CTGCTCAAGA 800 

ATTAGATTAC CTAACTCGTC ACTATTTGGT GAAAAATAAA 840 

AAACTCTATG AATTTAACAA CTCGCCTTAT GAAACGGGAT 88 0 

ATATTAAATT TATAGAAAAT GAGAATAGCT TTTGGTATGA 92 0 

CATGATGCCT GCACCAGGAG ATAAATTTGC CCAATCTAAA 960 

TATTTAATGA TGTACAATGA CAATAAAATG GTTGATTCTA 1000 

AAGATGTGAA GATTGAAGTT TATCTTACGA CAAAGAAAAA 1040 

GTGAAATTAT ATTTTAGAAA AGTAAATATG AAGAGTTAGT 1080 

AATTAAGGCA GGCACTTATA GAGTACCTGC CTTTTCTAAT 112 0 

ATTATTTAGT TATAGTTATT TTTGTTATAT CTCTCTGATT 1160 

TAGCATTAAC CCCTTGTTGC CATTATAGTT TTCACCAACT 12 00 

TTAGCTGAAA TTGGGGGATC ATTTTTATCT TTACTATGGA 1240 

TAGTTACTGT GTCGCCGTTT TTAACGATTT GTTTCTCTTT 1280 

TAATTTGTCA GTTAATTTTT TCCATGCATC ATTTGCGTCA 132 0 

AACCTATTTC CATTTGGATT TATTCTTGAC AAATCAATTC 13 60 

TTTTAACACT ATCGGTATTA ATCGGCTTGT TATTAAAATT 140 0 

ACTAAGTTCA TCTAAATCAG CTGTACCCGT AATACTACTT 1440 

TCGCCACCAT TATTTAAATT GTACGTAACA CCAACTGTCT 1480 

CATTTGCTGT TTTATCGATA ATATTTGCTT CTTTCAAAGC 152 0 

ATCTCTTACA TTTTTCCATA AGTCTCTATC TGTTATTTCA 156 0 

GAAGCCTTTG CAACGTTATT AAT AC C ATT A TAATTTGAAG 16 00 

AAGAATGAAA ACCTGAACCT ACTGTTGTTA AAACTAAAGC 1640 

ACTTGCTATC AATGTTCTTG TTAATAGTTT TTTATTCATT 1680 

TTATTTTCTC CTATAACTTA TTTGCAATCG AT 1712 



9 



INFORMATION FOR SEQUENCE ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 266 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Tyr Lys Arg Leu Phe lie Ser His Val 

5 10 

lie Leu lie Phe Ala Leu lie Leu Val lie 

15 20 

Ser Thr Pro Asn Val Leu Ala Glu Ser Gin 

25 30 

Pro Asp Pro Lys Pro Asp Glu Leu His Lys 

35 40 

Ser Ser Lys Phe Thr Gly Leu Met Glu Asp 

45 50 

Met Lys Val Leu Tyr Asp Asp Asn His Val 

55 60 

Ser Ala He Asn Val Lys Ser He Asp Gin 

65 70 

Phe Leu Tyr Phe Asp Leu He Tyr Ser He 

75 80 

Lys Asp Thr Lys Leu Gly Asp Tyr Asp Asn 

85 90 

Val Arg Val Glu Phe Lys Asn Lys Asp Leu 

95 100 

Ala Asp Lys Tyr Lys Asp Lys Tyr Val Asp 

105 HO 

Val Phe Gly Ala Asn Tyr Tyr Tyr Gin Cys 

115 120 

Tyr Phe Ser Lys Lys Thr Asn Asp He Asn 

125 13 0 

Ser His Gin Thr Asp Lys Arg Lys Thr Cys 

135 140 
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Met Tyr Gly Gly Val Thr Glu His Asn Gly 

145 150 

Asn Gin Leu Asp Lys Tyr Arg Ser He Thr 

155 160 

Val Arg Val Phe Glu Asp Gly Lys Asn Leu 

165 170 

Leu Ser Phe Asp Val Gin Thr Asn Lys Lys 

175 180 

Lys Val Thr Ala Gin Glu Leu Asp Tyr Leu 

185 190 

Thr Arg His Tyr Leu Val Lys Asn Lys Lys 

195 200 

Leu Tyr Glu Phe Asn Asn Ser Pro Tyr Glu 

205 210 

Thr Gly Tyr He Lys Phe He Glu Asn Glu 

215 220 

Asn Ser Phe Trp Tyr Asp Met Met Pro Ala 

225 230 

Pro Gly Asp Lys Phe Ala Gin Ser Lys Tyr 

235 240 

Leu Met Met Tyr Asn Asp Asn Lys Met Val 

245 250 

Asp Ser Lys Asp Val Lys He Glu Val Tyr 

255 260 

Leu Thr Thr Lys Lys Lys 

265 



(8) INFORMATION FOR SEQUENCE ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1712 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAACTAGGTA GAAAAATAAT TATGAGAAAA CACTATGTTG 4 0 



I 
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TTAAAGATGT TTTCGTATAT AAGTTTAGGT GATGTATAGT 80 

TACTTAATTT TAAAAGCATA ACTTAATTAA TATAAATAAC 120 

ATGAGATTAT TAAATATAAT TAAGTTTCTT TTAATGTTTT 160 

TTTAATTGAA TATTTAAGAT TATAACATAT ATTTAAAGTG 200 

TATCTAGATA CTTTTTGGGA ATGTTGGATA AAGGAGATAA 2 40 

AAAATGTATA AGAGATTATT TATTTCACAT GTAATTTTGA 280 

TATTCGCACT GATATTAGTT ATTTCTACAC CCAACGTTTT 320 

AGCAGAGAGT CAACCAGATC CTAAACCAGA TGAGTTGCAC 3 60 

AAATCGAGTA AATTCACTGG TTTGATGGAA AATATGAAAG 400 

TTTTGTATGA TGATAATCAT GTATCAGCAA TAAACGTTAA 440 

ATCTATAGAT CAATTTCGAT ACTTTGACTT AATATATTCT 480 

ATTAAGGACA CTAAGTTAGG GAATTATGAT AATGTTCGAG 520 

TCGAATTTAA AAACAAAGAT TTAGCTGATA AATACAAAGA 560 

TAAATACGTA GATGTGTTTG GAGCTAATGC TTATTATCAA 600 

TGTGCTTTTT CTAAAAAAAC GAATGATATT AATTCGCATC 640 

AAACTGACAA ACGAAAAACT TGTATGTATG GTGGTGTAAC 680 

TGAGCATAAT GGAAACCAAT TAGATAAATA TAGAAGTATT 720 

ACTGTTCGGG TATTTGAAGA TGGTAAAAAT TTATTATCTT 760 

TTGACGTACA AACTAATAAG AAAAAGGTGA CTGCTCAAGA 800 

ATTAGATTAC CTAACTCGTC ACTATTTGGT GAAAAATAAA 840 

AAACTCTATG AATTTAACAA CTCGC CTTAT GAAACGGGAT 880 

ATATTAAATT TATAGAAAAT GAGAATAGCT TTTGGTATGA 92 0 

CATGATGCCT GCACCAGGAG ATAAATTTGA CCAATCTAAA 9 60 

TATTTAATGA TGTACAATGA CAATAAAATG GTTGATTCTA 1000 

AAGATGTGAA GATTGAAGTT TATCTTACGA CAAAGAAAAA 1040 

GTGAAATTAT ATTTTAGAAA AGTAAATATG AAGAGTTAGT 1080 

AATTAAGGCA GGCACTTATA GAGTACCTGC CTTTTCTAAT 1120 
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ATTATTTAGT TATAGTTATT TTTGTTATAT CTCTCTGATT 116 0 

TAGCATTAAC CCCTTGTTGC CATTATAGTT TTCACCAACT 12 00 

TTAGCTGAAA TTGGGGGATC ATTTTTATCT TTACTATGGA 1240 

TAGTTACTGT GTCGCCGTTT TTAACGATTT GTTTCTCTTT 12 80 

TAATTTGTCA GTTAATTTTT TCCATGCATC ATTTGCGTCA 132 0 

AACCTATTTC CATTTGGATT TATTCTTGAC AAATCAATTC 13 60 

TTTTAACACT ATCGGTATTA ATCGGCTTGT TATTAAAATT 14 00 

ACTAAGTTCA TCTAAATCAG CTGTACCCGT AATACTACTT 1440 

TCGCCACCAT TATTTAAATT GTACGTAACA CCAACTGTCT 1480 

CATTTGCTGT TTTATCGATA ATATTTGCTT CTTTCAAAGC 1520 

ATCTCTTACA TTTTTCCATA AGTCTCTATC TGTTATTTCA 1560 

GAAGCCTTTG CAACGTTATT AATACCATTA TAATTTGAAG 1600 

AAGAATGAAA ACCTGAACCT ACTGTTGTTA AAACTAAAGC 1640 

ACTTGCTATC AATGTTCTTG TTAATAGTTT TTTATTCATT 1680 

TTATTTTCTC CTATAACTTA TTTGCAATCG AT 1712 



INFORMATION FOR SEQUENCE ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Tyr Lys Arg Leu Phe lie Ser His Val 

5 10 



lie Leu lie Phe Ala Leu lie Leu Val lie 

15 20 



Ser Thr Pro Asn Val Leu Ala Glu Ser Gin 

25 30 
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Pro Asp Pro Lys Pro Asp Glu Leu His Lys 

35 40 

Ser Ser Lys Phe Thr Gly Leu Met Glu Asn 

45 50 

Met Lys Val Leu Tyr Asp Asp Asn His Val 

55 60 

Ser Ala He Asn Val Lys Ser He Asp Gin 

65 ~ 70 

Phe Arg Tyr Phe Asp Leu He Tyr Ser He 

75 80 

Lys Asp Thr Lys Leu Gly Asn Tyr Asp Asn 

85 90 

Val Arg Val Glu Phe Lys Asn Lys Asp Leu 

95 100 

Ala Asp Lys Tyr Lys Asp Lys Tyr Val Asp 

105 HO 

Val Phe Gly Ala Asn Ala Tyr Tyr Gin Cys 

115 120 

Ala Phe Ser Lys Lys Thr Asn Asp He Asn 

125 130 

Ser His Gin Thr Asp Lys Arg Lys Thr Cys 

135 140 

Met Tyr Gly Gly Val Thr Glu His Asn Gly 

145 150 

Asn Gin Leu Asp Lys Tyr Arg Ser He Thr 

155 160 

Val Arg Val Phe Glu Asp Gly Lys Asn Leu 

165 170 

Leu Ser Phe Asp Val Gin Tyr Asn Lys Lys 

175 180 

Lys Val Thr Ala Gin Glu Leu Asp Tyr Leu 

185 190 

Thr Arg His Tyr Leu Val Lys Asn Lys Lys 

195 200 

Leu Tyr Glu Phe Asn Asn Ser Pro Tyr Glu 

205 210 
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Thr Gly Tyr lie Lys Phe lie Glu Asn Glu 

215 220 

Asn Ser Phe Trp Tyr Asp Met Met Pro Ala 

225 230 

Pro Gly Asp Lys Phe Asp Gin Ser Lys Tyr 

235 ' 240 

Leu Met Met Tyr Asn Asp Asn Lys Met Val 

245 250 

Asp Ser Lys Asp Val Lys He Glu Val Tyr 

255 260 

Leu Thr Thr Lys Lys Lys 

265 



(10) INFORMATION FOR SEQUENCE ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1388 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATGAGTCAAC CAGATCCTAA ACCAGATGAG TTGCACAAAT 40 

CGAGTAAATT CACTGGTTTG ATGGAAAATA TGAAAGTTTT 80 

GTATGATGAT AATCATGTAT CAGCAATAAA CGTTAAATCT 120 

ATAGATCAAT TTCGATACTT TGACTTAATA TATTCTATTA 160 

AGGACACTAA GTTAGGGAAT TATGATAATG TTCGAGTCGA 2 00 

ATTTAAAAAC AAAGATTTAG CTGATAAATA CAAAGATAAA 240 

TACGTAGATG TGTTTGGAGC TAATGCTTAT TATCAATGTG 2 80 

CTTTTTCTAA AAAAACGAAT GATATTAATT CGCATCAAAC 32 0 

TGACAAACGA AAAACTTGTA TGTATGGTGG TGTAACTGAG 360 

CATAATGGAA ACCAATTAGA TAAATATAGA AGTATTACTG 400 

TTCGGGTATT TGAAGATGGT AAAAATTTAT TATCTTTTGA 440 
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CGTACAAACT AATAAGAAAA AGGTGACTGC TCAAGAATTA 480 

GATTACCTAA CTCGTCACTA TTTGGTGAAA AATAAAAAAC 52 0 

TCTATGAATT TAACAACTCG CCTTATGAAA CGGGATATAT 560 

TAAATTTATA GAAAATGAGA ATAGCTTTTG GTATGACATG 600 

ATGCCTGCAC CAGGAGATAA ATTTGACCAA TCTAAATATT 640 

TAATGATGTA CAATGACAAT AAAATGGTTG ATTCTAAAGA 680 

TGTGAAGATT GAAGTTTATC TTACGACAAA GAAAAAGTGA 720 

AATTATATTT TAGAAAAGTA AATATGAAGA GTTAGTAATT 760 

AAGGCAGGCA CTTATAGAGT ACCTGCCTTT TCTAATATTA 800 

TTTAGTTATA GTTATTTTTG TTATATCTCT CTGATTTAGC 840 

ATTAACCCCT TGTTGCCATT ATAGTTTTCAC CAACTTTAG 880 

CTGAAATTGG GGGATCATTT TTATCTTTAC TATGGATAGT 92 0 

TACTGTGTCG CCGTTTTTAA CGATTTGTTT CTCTTTTAAT 960 

TTGTCAGTTA ATTTTTTCCA TGCATCATTT GCGTCAAACC 1000 

TATTTCCATT TGGATTTATT CTTGACAAAT CAATTCTTTT 1040 

AACACTATCG GTATTAATCG GCTTGTTATT AAAATTACTA 1080 

AGTTCATCTA AATCAGCTGT ACCCGTAATA CTACTTTCGC 1120 

CACCATTATT TAAATTGTAC GTAACACCAA CTGTCTCATT 1160 

TGCTGTTTTA TCGATAATAT TTGCTTCTTT CAAAGCATCT 12 00 

CTTACATTTT TCCATAAGTC TCTATCTGTT ATTTCAGAAG 12 40 

CCTTTGCAAC GTTATTAATA CCATTATAAT TTGAAGAAGA 12 80 

ATGAAAACCT GAACCTACTG TTGTTAAAAC TAAAGCACTT 1320 

GCTATCAATG TTCTTGTTAA TAGTTTTTTA TTCATTTTAT 13 6 0 

TTTCTCCTAT AACTTATTTG CAATCGAT 13 8 8 

(11) INFORMATION FOR SEQUENCE ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 239 



16 



(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Gin Pro Asp Pro Lys Pro Asp Glu 

5 10 

Leu His Lys Ser Ser Lys Phe Thr Gly Leu 

15 20 

Met Glu Asn Met Lys Val Leu Tyr Asp Asp 

25 30 

Asn His Val Ser Ala He Asn Val Lys Ser 

35 40 

He Asp Gin Phe Arg Tyr Phe Asp Leu He 

45 50 

Tyr Ser He Lys Asp Thr Lys Leu Gly Asn 

55 60 

Tyr Asp Asn Val Arg Val Glu Phe Lys Asn 

65 70 

Lys Asp Leu Ala Asp Lys Tyr Lys Asp Lys 

75 80 

Tyr Val Asp Val Phe Gly Ala Asn Ala Tyr 

85 90 

Tyr Gin Cys Ala Phe Ser Lys Lys Thr Asn 

95 100 

Asp He Asn Ser His Gin Thr Asp Lys Arg 

105 HO 

Lys Thr Cys Met Tyr Gly Gly Val Thr Glu 

115 " 120 

His Asn Gly Asn Gin Leu Asp Lys Tyr Arg 

125 130 

Ser He Thr Val Arg Val Phe Glu Asp Gly 

135 140 

Lys Asn Leu Leu Ser Phe Asp Val Gin Thr 

145 150 

Asn Lys Lys Lys Val Thr Ala Gin Glu Leu 
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155 160 

Asp Tyr Leu Thr Arg His Tyr Leu Val Lys 

165 170 

Asn Lys Lys Leu Tyr Glu Phe Asn Asn Ser 

175 180 

Pro Tyr Glu Thr Gly Tyr He Lys Phe He 

185 190 

Glu Asn Glu Asn Ser Phe Trp Tyr Asp Met 

195 200 

Met Pro Ala Pro Gly Asp Lys Phe Asp Gin 

205 210 

Ser Lys Tyr Leu Met Met Tyr Asn Asp Asn 

215 220 

Lys Met Val Asp Ser Lys Asp Val Lys He 

225 230 

Glu Val Tyr Leu Thr Thr Lys Lys Lys 

235 



(12) INFORMATION FOR SEQUENCE ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TAAGGAGAAT TAAAAATGAA TAAAAAATTA CTAATGAATT 40 

TTTTTATCGT AAGCCCTTTG TTGCTTGCGA CAACTGCTAC 80 

AGATTTTACC CCTGTTCCCT TATCATCTAA TCAAATAATC 120 

AAAACTGCAA AAGCATCTAC AAACGATAAT ATAAAGGATT 160 

TGCTAGACTG GTATAGTAGT GGGTCTGACA CTTTTACAAA 2 00 



TAGTGAAGTT TTAGATAATT CCAGAGGATC TATGCGTATA 
AAAAACACAG ATGGCAGCAT CAGCTTGATA ATTTTTCCGA 



240 
280 



u 



X 
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GTCCTTATTA TAGCCCTGCT TTTACAAAAG GGGAAAAAGT 320 

TGACTTAAAC ACAAAAAGAA CTAAAAAAAG CCAACATACT 360 

AGCGAAGGAA CTTATATCCA TTTCCAAATA AGTGGCGTTA 4 00 

CAAATACTGA AAAATTACCT ACTCCAATAG AACTACCTTT 440 

AAAAGTTAAG GTTCATGGTA AAGATAGCCC CTTAAAGTAT 480 

GGGCCAAAGT TCGATAAAAA ACAATTAGCT ATATCAACTT 52 0 

TAGACTTTGA AATTCGTCAT CAGCTAACTC AAATACATGG 560 

ATTATATCGT TCAAGCGATA AAACGGGTGG TTATTGGAAA 600 

ATAACAATGA ATGACGGATC CACATATCAA AGTGATTTAT 640 

CTAAAAAGTT TGAATACAAT ACTGAAAAAC CACCTATAAA 68 0 

TATTGATGAA ATAAAAACTA TAGAAGCAGA AATTAATTAA 72 0 

TTTACCACTT T 731 



(13) INFORMATION FOR SEQUENCE ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Unknown 

(D) TOPOLOGY: Unknown 



(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Asn Lys Lys Leu Leu Met Asn Phe Phe 

5 10 

lie Val Ser Pro Leu Leu Leu Ala Thr Thr 

15 20 

Ala Thr Asp Phe Thr Pro Val Pro Leu Ser 

25 30 

Ser Asn Gin lie lie Lys Thr Ala Lys Ala 

35 40 

Ser Thr Asn Asp Asn lie Lys Asp Leu Leu 

45 50 

Asp Trp Tyr Ser Ser Gly Ser Asp Thr Phe 

55 60 
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Thr Asn Ser Glu Val Leu Asp Asn Ser Arg 

65 70 

Gly Ser Met Arg He Lys Asn Thr Asp Gly 

75 80 

Ser He Ser Leu He He Phe Pro Ser Pro 

85 90 

Tyr Tyr Ser Pro Ala Phe Thr Lys Gly Glu 

95 100 

Lys Val Asp Leu Asn Thr Lys Arg Thr Lys 

105 HO 

Lys Ser Gin His Thr Ser Glu Gly Thr Tyr 

115 HO 

He His Phe Gin He Ser Gly Val Thr Asn 

125 130 

Thr Glu Lys Leu Pro Thr Pro He Glu Leu 

135 140 

Pro Leu Lys Val Lys Val His Gly Lys Asp 

145 150 

Ser Pro Leu Lys Tyr Gly Pro Lys Phe Asp 

155 160 

Lys Lys Gin Leu Ala He Ser Thr Leu Asp 
~ ' 165 170 

Phe Glu He Arg His Gin Leu Thr Gin He 

175 180 

His Gly Leu Tyr Arg Ser Ser Asp Lys Thr 

185 190 

Gly Gly Tyr Trp Lys He Thr Met Asn Asp 

195 200 

Gly Ser Thr Tyr Gin Ser Asp Leu Ser Lys 

205 210 

Lys Phe Glu Tyr Asn Thr Glu Lys Pro Pro 

215 220 

He Asn He Asp Glu He Lys Thr He Glu 

225 230 

Ala Glu He Asn 
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-5 



(14) INFORMATION FOR SEQUENCE ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1095 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 



(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATCATTAAAT ATAATTAATT TTCTTTTAAT ATTTTTTTAA 40 

TTGAATATTT AAGATTATAA GATATATTTA AAGTGTATCT 80 

AGATACTTTT TGGGAATGTT GGATGAAGGA GATAAAAATG 12 0 

AATAAGAGTC GATTTATTTC ATGCGTAATT TTGATATTCG 160 

CACTTATACT AGTTCTTTTT ACACCCAACG TATTAGCAGA 20 0 

GAGCCAACCA GACCCTACGC CAGATGAGTT GCACAAAGCG 240 

AGTAAATTCA CTGGTTTGAT GGAAAATATG AAAGTTTTAT 2 80 

ATGATGATCA TTATGTATCA GCAACTAAAG TTAAGTCTGT 32 0 

AGATAAATTT AGGGCACATG ATTTAATTTA TAACATTAGT 360 

GATAAAAAAC TGAAAAATTA TGACAAAGTG AAAACAGAGT 400 

TATTAAATGA AGGTTTAGCA AAGAAGTACA AAGATGAAGT 440 

AGTTGATGTG TATGGATCAA ATTACTATGT AAACTGCTAT 480 

TTTTCATCCA AAGATAATGT AGGTAAAGTT ACAGGTGGCA 520 

AAACTTGTAT GTATGGAGGA ATAACAAAAC ATGAAGGAAA 56 0 

CCACTTTGAT AATGGGAACT TACAAAATGT ACTTATAAGA 600 

GTTTATGAAA ATAAAAGAAA CACAATTTCT TTTGAAGTGC 640 

AAAC TGATAA GAAAAGTGTA ACAGCTCAAG AACTAGACAT 68 0 

AAAAGCTAGG AATTTTTTAA TTAATAAAAA AAATTTGTAT 720 

GAGTTTAACA GTTCACCATA TGAAACAGGA TATATAAAAT 760 



TTATTGAAAA TAACGGCAAT ACTTTTTGGT ATGATATGAT 



800 
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GCCTGCACCA GGCGATAAGT TTGACCAATC TAAATATTTA 84 0 

ATGATGTACA ACGACAATAA AACGGTTGAT TCTAAAAGTG 880 

TGAAGATAGA AGTCCACCTT ACAACAAAGA ATGGATAATG 92 0 

TTAATCCGAT TTTGATATAA AAAGTGAAAG TATTAGATAT 960 

ATTTGAAAGG TAAGTACTTC GGTGCTTGCC TTTTTAGGAT 1000 

GCATATATAT AGATTAAACC GCACTTCTAT ATTAATAGAA 1040 

AGTGCGGTTA TTTATACACT CAATCTAAAC TATAATAATT 1080 

GGAATCATCT TCAAA 1095 

(15) INFORMATION FOR SEQUENCE ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknown 

( D ) TOPOLOGY : Unknown 

(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met Asn Lys Ser Arg Phe He Ser Cys Val 

5 10 

He Leu He Phe Ala Leu He Leu Val Leu 

15 20 

Phe Thr Pro Asn Val Leu Ala Glu Ser Gin 

25 30 

Pro Asp Pro Thr Pro Asp Glu Leu His Lys 

35 40 

Ala Ser Lys Phe Thr Gly Leu Met Glu Asn 

45 50 

Met Lys Val Leu Tyr Asp Asp His Tyr Val 

55 60 

Ser Ala Thr Lys Val Lys Ser Val Asp Lys 

65 70 

Phe Arg Ala His Asp Leu He Tyr Asn He 

75 80 
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Ser Asp Lys Lys Leu Lys Asn Tyr Asp Lys 

85 90 

Val Lys Thr Glu Leu Leu Asn Glu Gly Leu 

95 100 

Ala Lys Lys Tyr Lys Asp Glu Val Val Asp 

105 110 

Val Tyr Gly Ser Asn Tyr Tyr Val Asn Cys 

115 120 

Tyr Phe Ser Ser Lys Asp Asn Val Gly Lys 

125 " 130 

Val Thr Gly Gly Lys Thr Cys Met Tyr Gly 

135 140 

Gly He Thr Lys His Glu Gly Asn His Phe 

145 150 

Asp Asn Gly Asn Leu Gin Asn Val Leu He 

155 160 

Arg Val Tyr Glu Asn Lys Arg Asn Thr He 

165 170 

Ser Phe Glu Val Gin Thr Asp Lys Lys Ser 

175 180 

Val Thr Ala Gin Glu Leu Asp He Lys Ala 

185 190 

Arg Asn Phe Leu He Asn Lys Lys Asn Leu 

195 200 

Tyr Glu Phe Asn Ser Ser Phe Tyr Glu Thr 

205 210 

Gly Tyr He Lys Phe He Glu Asn Asn Gly 

215 220 

Asn Thr Phe Trp Tyr Asp Met Met Pro Ala 

225 230 

Pro Gly Asp Lys Phe Asp Gin Ser Lys Tyr 

235 240 

Leu Met Met Tyr Asn Asp Asn Lys Thr Val 

245 250 

Asp Ser Lys Ser Val Lys He Glu Val His 

255 260 
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Leu Thr Thr Lys Asn Gly 

265 

(16) INFORMATION FOR SEQUENCE ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1837 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 

(ii) Molecule type: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCATGTTTGA CAGCTTATCA TCGATAAGCT TACTTTTCGA 40 

ATCAGGTCTA TCCTTGAAAC AGGTGCAACA TAGATTAGGG 80 

CATGGAGATT TACCAGACAA CTATGAACGT ATATACTCAC 12 0 

ATCACGCAAT CGGCAATTGA TGACATTGGA ACTAAATTCA 160 

ATCAATTTGT TACTAACAAG CAACTAGATT GACAACTAAT 200 

TCTCAACAAA CGTTAATTTA ACAACATTCA AGTAACTCCC 240 

ACCAGCTCCA TCAATGCTTA CCGTAAGTAA TCATAACTTA 280 

CTAAAACCTT GTTACATCAA GGTTTTTTCT TTTTGTCTTG 320 

TTCATGAGTT ACCATAACTT TC TAT ATT AT TGACAACTAA 3 60 

ATTGACAACT CTTCAATTAT TTTTCTGTCT ACTCAAAGTT 400 

TTCTTCATTT GATATAGTCT AATTCCACCA TCACTTCTTC 440 

CACTCTCTCT ACCGTCACAA CTTCATCATC TCTCACTTTT 480 

TCGTGTGGTA ACACATAATC AAATATCTTT CCGTTTTTAC 52 0 

GCACTATCGC TACTGTGTCA CCTAAAATAT ACCCCTTATC 5 60 

AATCGCTTCT TTAAACTCAT CTATATATAA CATATTTCAT 600 

CCTCCTACCT ATCTATTCGT AAAAAGATAA AAATAACTAT 640 

TGTTTTTTTT GTTATTTTAT AATAAAATTA TTAATATAAG 680 

TTAATGTTTT TTAAAAATAT ACAATTTTAT TCTATTTATA 72 0 



GTTAGCTATT TTTTCATTGT TAGTAATATT GGTGAATTGT 



760 



AATAACCTTT TTAAATCTAG AGGAGAACCC AGATATAAAA 



800 



TGGAGGAATA TTAATGGAAA ACAATAAAAA AGTATTGAAG 



840 



AAAATGGTAT TTTTTGTTTT AGTGACATTT CTTGGACTAA 



880 



CAATCTCGCA AGAGGTATTT GCTCAACAAG ACCCCGATCC 



920 



AAGCCAACTT CACAGATCTA GTTTAGTTAA AAACCTTCAA 



960 



* 




AATATATATT TTCTTTATGA GGGTGACCCT GTTACTCACG 100 0 

AGAATGTGAA ATCTGTTGAT CAACTTAGAT CTCACGATTT 1040 

AATATATAAT GTTTCAGGGC CAAATTATGA TAAATTAAAA 1080 

ACTGAACTTA AGAACCAAGA GATGGCAACT TTATTTAAGG 112 0 

ATAAAAACGT TGATATTTAT GGTGTAGAAT ATTACCATCT 1160 

CTGTTATTTA TGTGAAAATG CAGAAAGGAG TGCATGTATC 1200 

TACGGAGGGG TAACAAATCA TGAAGGGAAT CATTTAGAAA 1240 

TTCCTAAAAA GATAGTCGTT AAAGTATCAA TCGATGGTAT 128 0 

CCAAAGCCTA TCATTTGATA TTGAAACAAA TAAAAAAATG 132 0 

GTAACTGCTC AAGAATTAGA C TAT AAAGTT AGAAAATATC 1360 

TTACAGATAA TAAGCAACTA TATACTAATG GACCTTCTAA 1400 

ATATGAAACT GGATATATAA AGTTCATACC TAAGAATAAA 1440 

GAAAGTTTTT GGTTTGATTT TTTCCCTGAA CCAGAATTTA 1480 

CTCAATCTAA ATATCTTATG ATATATAAAG ATAATGAAAC 1520 

GCTTGACTCA AACACAAGCC AAATTGAAGT CTACCTAACA 1560 

ACCAAGTAAC TTTTTGCTTT TGGCAACCTT ACCTACTGCT 1600 

GGATTTAGAA ATTTTATTGC AATTCTTTTA TTAATGTAAA 1640 

AACCGCTCAT TTGATGAGCG GTTTTGTCTT ATCTAAAGGA 1680 

GCTTTACCTC CTAATGCTGC AAAATTTTAA ATGTTGGATT 1720 

TTTGTATTTG TCTATTGTAT TTGATGGGTA ATCCCATTTT 1760 

TCGACAGACA TCGTCGTGCC ACCTCTAACA CCAAAATCAT 1800 
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AGACAGGAGC TTGTAGCTTA GCAACTATTT TATCGTC 1837 



(17) INFORMATION FOR SEQUENCE ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Unknown 

(D) TOPOLOGY: Unknown 
(ii) Molecule type: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Glu Asn Asn Lys Lys Val Leu Lys Lys 

5 10 

Met Val Phe Phe Val Leu Val Thr Phe Leu 

15 20 

Gly Leu Thr lie Ser Gin Glu Val Phe Ala 

25 30 

Gin Gin Asp Pro Asp Pro Ser Gin Leu His 

35 40 

Arg Ser Ser Leu Val Lys Asn Leu Gin Asn 

45 50 

He Tyr Phe Leu Tyr Glu Gly Asp Pro Val 

55 60 

Thr His Glu Asn Val Lys Ser Val Asp Gin 

65 70 

Leu Arg Ser His Asp Leu He Tyr Asn Val 

75 80 

Ser Gly Pro Asn Tyr Asp Lys Leu Lys Thr 

85 90 

Glu Leu Lys Asn Gin Glu Met Ala Thr Leu 

95 100 

Phe Lys Asp Lys Asn Val Asp He Tyr Gly 

105 HO 

Val Glu Tyr Tyr His Leu Cys Tyr Leu Cys 

115 120 

Glu Asn Ala Glu Arg Ser Ala Cys He Tyr 

125 130 

Gly Gly Val Thr Asn His Glu Gly Asn His 
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135 



140 



Leu Glu lie Pro Lys Lys lie Val Val Lys 

145 150 

Val Ser He Asp Gly He Gin Ser Leu Ser 

155 160 

Phe Asp He Glu Thr Asn Lys Lys Met Val 

165 170 

Thr Ala Gin Glu Leu Asp Tyr Lys Val Arg 

175 180 

Lys Tyr Leu Thr Asp Asn Lys Gin Leu Tyr 

185 190 

Thr Asn Gly Pro Ser Lys Tyr Glu Thr Gly 

195 200 

Tyr He Lys Phe He Pro Lys Asn Lys Glu 

205 210 

Ser Phe Trp Phe Asp Phe Phe Pro Glu Pro 
215 220 

Glu Phe Thr Gin Ser Lys Tyr Leu Met He 

225 230 

Tyr Lys Asp Asn Glu Thr Leu Asp Ser Asn 

235 240 



Thr Ser Gin He Glu Val Tyr Leu Thr Thr 

245 250 



Lys 



SEQ ID NO: 17 

sense primer for cloning SpeA 

5' CTCG CAA GAG GTA CAT ATG CAA CAA GAC 3 ' (SEQ ID NO: ) , 
sense primer to introduce a unique Ndel site; 

SEQ ID NO: 18 

antisense primer for cloning SpeA 

5' GCA GTA GGT AAG CTT GCC AAA AGC 3' (SEQ ID NO : ) 



SEQ ID NO: 19 

1» SpeA forward primer , including Ndel site: 



5' G AT AT AC AT ATGC AAC AAGAC C C C G ATC C AAGC C 3' 34-mer 
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SEQIDNO:20 

SpeA reverse primer , adds SpeB overlap 

5' GAGATTTAACAACTGGTTGCTTGGTTGTTAGGTAGAC 3' 37-mer 



SEQ ID NO: 21 

3. SpeB forward primer , adds SpeA overlap: 

5 ' G TC T AC C T AAC AAC C AAGC AACCAGTTGTTAAATCTC 3 ' 
SEQ ID NO: 22 

4. S peB reverse primer ; adds stop site (Amber) and maintains BamHI site: 

5' GAATTCGGATCCGCTAGCCTACAACAG 3' 27-mer 



SEQ ID NO: 23 

SpeA(L42R) -SpeB(C47S) gene insert DNA sequence 
1 

atgcaacaag accccgatcc aagccaactt cacagatcta gtttagttaa 
51 

aaaccttcaa aatatatatt ttctttatga gggtgaccct gttactcacg 
101 

agaatgtgaa atctgttgat caacttcgat ctcacgattt aatatataat 
151 

gtttcagggc caaattatga taaattaaaa actgaactta agaaccaaga 
201 

gatggcaact ttatttaagg ataaaaacat tgatatttat ggtgtagaat 
251 

attaccatct ctgttattta tgtgaaaatg cagaaaggag tgcatgtatc 
301 

tacggagggg taacaaatcg tgaagggaat catttagaaa ttcctaaaaa 
351 

gatagtcgtc aaagtatcaa tcgatggtat acaaagccta tcatttgata 
401 

ttgaaacaaa taaaaaaatg gtaactgctc aagaattaga ctataaagtt 
451 

agaaaatatc ttacagataa taagcaacta tatactaatg gaccttctaa 
501 

atatgaaact ggatatataa agttcatacc taagaataaa gaaagttttt 
551 

ggtttgattt tttccctgaa ccagaattta ctcaatctaa atatcttatg 
601 

atatataaag ataatgaaac gcttgactca aacacaagcc aaattgaagt 
651 

ctacctaaca accaagcaac cagttgttaa atctctcctt gattcaaaag 
701 

gcattcatta caatcaaggt aacccttaca acctattgac acctgttatt 
751 

gaaaaagtaa aaccaggtga acaatctttt gtaggtcaac atgcagctac 
801 

aggatgtgtt gctactgcaa ctgctcaaat tatgaaatat cataattacc 
851 
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ctaacaaagg gttgaaagac tacacttaca cactaagctc aaataaccca 
901 

tatttcaacc atcctaagaa cttgtttgca gctatctcta ctagacaata 
951 

caactggaac aacatcctac ctacttatag cggaagagaa tctaacgttc 
1001 

aaaaaatggc gatttcagaa ttgatggctg atgttggtat ttcagtagac 
1051 

atggattatg gtccatctag tggttctgca ggtagctctc gtgttcaaag 
1101 

agccttgaaa gaaaactttg gctacaacca atctgttcac caaatcaacc 
1151 

gtagcgactt tagcaaacaa gattgggaag cacaaattga caaagaatta 
1201 

tctcaaaacc aaccagtata ctaccaaggt gtcggtaaag taggcggaca 
1251 

tgcctttgtt atcgatggtg ctgacggacg taacttctac catgttaact 
1301 

ggggttgggg tggagtctct gacggcttct tccgtcttga cgcactaaac 
1351 

ccttcagctc ttggtactgg tggcggcgca ggcggcttca acggttacca 
1401 

aagtgctgtt gtaggctag 



SEQ ID NO: 24 

Full-length SpeB polypeptide (Kagawa 

1 MNKKKLGIRL LSLLALGGFV LANPVFADQN 

61 DIKLDKVNLG GELSGSNMYV YNISTGGFVI 

121 MESYVEQIKE NKKLDTTYAG TAEIKQPWK 

181 QSFVGQHAAT GCVATATAQI MKYHNYPNKG 

2 41 NWNNILPTYS GRESNVQKMA I S E LMADVG I 

3 01 SVHQINRGDF SKQDWEAQID KELSQNQPVY 
3 61 GVSDGFFRLD ALNPSALGTG GGAGGFNGYQ 



et al., PNAS 97:2235-2240. 2000) 

FARNEKEAKD SAITFIQKSA AIKAGARSAE 

VSGDKRSPEI LGYSTSGSFD ANGKENIASF 

SLLDSKGIHY NQGNPYNLLT PVIEKVKPGE 

LKDYTYTLSS NNPYFNHPKN LFAAISTRQY 

SVDMDYGPSS GSAGSSRVQR ALKENFGYNQ 

YQGVGKVGGH AFVIDGADGR NFYHVNWGWG 
SAWGIKP 
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SEQ ID NO: 25 

SpeB clone used for fusion, mature polypeptide. Estimated M r = 28.75 kDa 

1 QPWKSLLDS KGIHYNQGNP YNLLTPVIEK VKPGEQSFVG QHAATGCVAT 

51 ATAQIMKYHN YPNKGLKDYT YTLSSNNPYF NHPKNLFAAI STRQYNWNNI 

101 LPTYSGRESN VQKMAISELM ADVGISVDMD YGPSSGSAGS SRVQRALKEN 

151 FGYNQSVHQI NRSDFSQDWE AQIDKELSQN QPVYYQGGKV GGHAFVIDGA 

2 01 DGRNFYHVNW GWGGVSDGFF RLDALNPSAL GTGGGAGGFN GYQSAWG 

SEQ ID NO: 26 

L42R SpeA mutant clone used for fusion. Estimated M r = 2 5.2 kDa 

1 MQQDPDPSQL HRSSLVKNLQ NIYFLYEGDP VTHENVKSVD QLRSHDLIYN 

51 VSGPNYDKLK TELKNQEMAT LFKDKNIDIY GVEYYHLCYL CENAERSACI 

101 GGVTNREGNH LEIPKKIWK VSIDGIQSLS FDIETNKKMV TAQELDYKVR 

151 KYLTDNKQLY TNGPSKYETG YIKFIPKNKE SFWFDFFPEP EFTQSKYLMI 
2 01 YKDNETLDSN TQIEVYLTTK 



SEQ ID NO: 27 

SpeA [L42R]-SpeB [C47S] fusion. Estimated M r = 54 kDa 

1 MQQDPDPSQL HRSSLVKNLQ NIYFLYEGDP VTHENVKSVD QLRSHDLIYN 

51 VSGPNYDKLK TELKNQEMAT LFKDKNIDIY GVEYYHLCYL CENAERSACI 

101 GGVTNREGNH LEIPKKIWK VSIDGIQSLS FDIETNKKMV TAQELDYKVR 

151 KYLTDNKQLY TNGPSKYETG YIKFIPKNKE SFWFDFFPEP EFTQSKYLMI 

201 YKDNETLDSN TQIEVYLTTK QPWKSLLDS KGIHYNQGNP YNLLTPVIEK 

251 VKPGEQSFVG QHAATGCVAT ATAQIMKYHN YPNKGLKDYT YTLSSNNPYF 

3 01 NHPKNLFAAI STRQYNWNNI LPTYSGRESN VQKMAISELM ADVGISVDMD 

3 51 YGPSSGSAGS SRVQRALKEN FGYNQSVHQI NRSDFSQDWE AQIDKELSQN 

401 QPVYYQGGKV GGHAFVIDGA DGRNFYHVNW GWGGVSDGFF RLDALNPSAL 
551 GTGGGAGGFN GYQSAWG 



SEQ ID NO: 28 

1. SpeA forward -primer , including Ndel site: 

5' GATATACATATGCAACAAGACCCCGATCCAAGCC 3' 3 4-mer 

SEQ ID NO : 29 

2. SpeA reverse primer : kills Ndel site, adds SpeB overlap: 



5' CATGTGTATATCTCCTTCCTTGGTTGTTAGGTAGAC 3' 3 6-mer 
SEQ ID NO: 30 

3. SpeB forward primer : kills Ndel site, adds SpeA overlap: 
5' GT C T A C C T A A C A A C C AA G G A AG GAG AT AT A C A CAT G 3' 3 6-mer 



SEQ ID NO: 31 

4. SpeB reverse primer : adds stop site (Amber) and maintains BamHI site: 
5' GAATTCGGATCCGCTAGCCTACAACAG 3 1 27-mer 



